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A note on Talagrand's posit ivity principle. 
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qq ' Talagrand's positivity principle, Section 6.6 in [8], states that one can slightly per- 

turb a Hamiltonian in the Sherrington-Kirkpatrick model in such a way that the overlap 
of two configurations under the perturbed Gibbs' measure will become typically non- 
negative. In this note we observe that abstracting from the setting of the SK model only 
Qh ' improves the result and does not require any modifications in Talagrand's argument. In 

this version, for example, positivity principle immediately applies to the setting in [9]. 
Also, abstracting from the SK model improves the conditions in the Ghirlanda-Guerra 
identities and as a consequence results in a perturbation of smaller order necessary to 
ensure positivity of the overlap. 
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^ ■ 1 Introduction. 

o 

Let us consider a unit sphere S = {z e M. N : \z\ = 1} on euclidean space M. N and let v be a 
probability measure on S. Given a measurable function g : S — > R let us define a probability 
measure v g 



measure v q on S by a change of density 



dv f es( z )du(z)' 

We assume that the denominator on the right hand side is integrable. Let us make a specific 
choice of g(z) given by 

g( z ) = v^22~ p x p g p (z) for g p (z) = 9i u ...,i p z h - ■ ■ z iv , (1.1) 

p>l l<h,...,i p <N 

where v > 0, OCp SbTQ 1.1. d. random variables uniform on [0, 1] and g^,...^ are i.i.d. standard 
Gaussian for all ii, . . . , i p and all p > 1. 
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Given a function / on S n let us denote by (/) its average with respect to measure v® n . 
Let us denote by K g the expectation with respect to Gaussian random variables and by E x 
the expectation with respect to uniform random variables in the definition of g(z) in (11. II) . 
The following is the main result of this note. 

Theorem 1 (Talagrand's positivity principle) For any e > there exists large enough v > 1 
in U.l\) such that 

Euf {z 1 -z 2 < -s} <e. (1.2) 
The choice of v does not depend on N and v. 

This means that one can define a random perturbation v g of an arbitrary measure v 
such that the scalar product z 1 ■ z 2 of two vectors drawn independently from distribution 
v g will be typically nonnegative. This result was proved in Section 6.6 [8] in the setting of 
the Sherrington-Kirkpatrick model where v was a random Gibbs' measure and in which case 
the expectation on the left hand side of (jl.2p was also in the randomness of v. The main 
ingredient of the proof was the Ghirlanda-Guerra identities that are typically given in the 
setting of the SK model as well. 

The main contribution of this note is an observation that abstracting ourselves from the 
setting of the SK model results in some qualitative improvements of the positivity principle 
of Theorem [TJ First of all, we notice that Talagrand's proof in j8] requires no modifications in 
order to prove a positivity principle that holds uniformly over v rather than on average over 
a random Gibbs' measure in the SK model. This observation, for example, implies the result 
in [9] without any additional work as will be shown in Example below. Another important 
qualitative improvement is the fact that the choice of v in (11. 2p is independent of N. In [8], 
one needed v ^> iV 1 / 4 - a condition that appears in the proof of Ghirlanda-Guerra identities 
due to the fact that one controls random Gibbs' measure from the very beginning. We will 
show below that the supremum of g(z) on the sphere is of order v\fN which means that one 
can perturb any measure on 5* by a change of density of order expfv^ and force the scalar 
product z l ■ z 2 to be essentially nonnegative. 

Example (Positivity in Guerra's replica symmetry breaking bound, f9j/). The main re- 
sult in [9] states that Guerra's replica symmetry breaking bound [3] applies to odd p-spin 
interactions as well. The proof utilizes the Aizenman-Sims-Starr version of Guerra's inter- 
polation p] and a positivity principle that requires a concentration inequality for the free 
energy along the interpolation. We observe that this positivity principle follows directly from 
Theorem [H Let A be a countable set and let (w a ) be a probability function on A such that 
w a > 0, Wa = 1" Let H(z, a) be a function on Q = S x A for some finite subset S of 

S. Let us consider a probability measure on Q given by 

fi{(z,a)} ~ w a exp(H(z,a) + g{z)) 

where g(z) is defined in (II. ip . Then its marginal on S is equal to v g if we define v by 
v{z} ~ Yl w a ex P H(z, a). Therefore, 
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By Theorem [H for large enough v > 0, 

E^ 2 {z 1 -z 2 <-e}<£ 

and this inequality holds uniformly over all choices of H and w. Therefore, we can average 
over arbitrary random distribution of H and w. In particular, in [9], (w a ) was a random 
Derrida-Ruelle process on A = N fc and H was Guerra's interpolating Hamiltonian in the 
form of Aizenman-Sims-Starr. 

□ 

General remarks, (i) As we will see below, Talagrand's proof of positivity principle uses 
very deep information about a joint distribution of z 1 ■ z 2 for 2 < / < n under measure E s z/® n 
for a typical realization of (x p ). However, it is not clear whether this deep information is really 
necessary to prove positivity and if some simpler argument would not suffice. For example, 
if we consider only a first order Gaussian term in ( 11. II) and define g'{z) = v^2 i<N giZi then 
measure v g < under a change of density proportional to e g '^ would favor a random direction 
g = (gi, ...,<7at) and it is conceivable that for large enough v independent of v and N 
two independent vectors z 1 and z 2 from this measure would typically "point in the same 
direction", i.e. 



th « 2r 1 2 ! ^\l{z l -z 2 <-e)e^ zl ^ z ^dv{z v )dv{z 2 ) 
Ez/fjz 1 • z < —e] = E = - , — — — — < e. 

9 ~ (J e^dis(z)) 2 



In fact, even a weaker result with v = o(vN) would be sufficient for applications as in [9]. 
(ii) Theorem [1] implies the following non-random version of positivity principle. 

Corollary 1 For any e > there exists v > large enough such that the following holds. 
For any distribution Q on the set of measures on the sphere S there exists a (non-random) 
function g(z) such that for some absolute constant L, 

sup \g(z)\ < LvvHv and [ vf{z x ■ z 2 < -e]dQ{y) < e. 
zes J 

It would be of interest to prove this result directly and not as a corollary of Theorem 
[TJ Then, by Hahn-Banach theorem, one can find a distribution P on the set of functions 
{sup 2g5 \g(z)\ < Lv\^N} such that for all probability measures v on S, 



j vf{z x ■ z 2 < -e}dP(g) < e. 
This would give another proof of Theorem [1] with non constructive description of P. 

□ 

Sketch of the proof of positivity principle. The main ingredient in Talagrand's proof is 
the extended Ghirlanda-Guerra identities (Section 6.4 in [8j) that state that if we sample 
(z 1 , . . . , z n+1 ) from a measure E 9 z^f ( ' n+1 ' ) (which is a mixture of product measures of v g over 
the randomness of Gaussian r.v.s) then for a typical realization of (x p ) the scalar product 
z 1 ■ z n+1 with probability 1/n is independent of z l ■ z l , 2 < / < n, and with probabilities 1/n 
it is equal to one of them. More precisely, the following holds. 
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Theorem 2 (Ghirlanda-Guerra identities) For any measurable function f on S n such that 
|/| < 1 and for any continuous function if) on [— 1, 1] we have 



E x 



E^iz 1 ■ z n+1 )) - -E^m^z 1 ■ z 2 )) - - ^ E^z 1 ■ z 1 )) <8{ifs,n,v) (1.3) 



n n 

2<Kn 



where 8{if>, n, v) — *■ as v — > oo and 5 does not depend on N and v. 

The main idea of Talagrand's proof can be shortly described as follows. Suppose that we 
sample z 1 , . . . , z n independently from any measure on S. Then the event that all z 1 ■ z l < —e 
simultaneously is very unlikely and its probability is or order l/(ne). The bound on this 
probability is uniform over all measures and therefore can be averaged over some distribution 
on measures and holds, for example, for E g is® n . On the other hand, by Ghirlanda-Guerra 
identities under measure E g v® n the events {z 1 ■ z l < —e} are strongly correlated due to 
the fact that with some prescribed probabilities z 1 ■ z l can be equal. As a consequence, the 
simultaneous occurrence of these events will have probability of order a/n 1 ~ a where a is the 
probability of one of them. But since this quantity is bounded by l/(ne), taking n large 
enough shows that a should be small enough. 



2 Proofs. 



The proof of Theorem [2] follows exactly the same argument as the proof of Ghirlanda-Guerra 
identities in [2J or in [8]. Since we consider a fixed measure is, we do not need to control a 
fluctuations of a random Hamiltonian and, as a result, we get a better condition on v. The 
main part of the proof is the following lemma. 

Lemma 1 For any p > 1 there exists a constant L p that depends on p only such that 

E(\g p (z)-E g (g p (z))\)<L p ^. (2.1) 

Proof. Let us fix (x p ) p >i in the definition of g(z) and until the end of the proof let E denote 
the expectation in Gaussian random variables only. Define 

= log J e 9(z) dis(z), 4) = E6. 

Given p > 1, let us think of 9 and if) as functions of x = x p only and define v p = v2~ p . Then 
0'{x) = v p (g p (z)) and 

if)'(x) = v p E(g p (z)) = v 2 p x(l - E^z 1 ■ z 2 f)) < v 2 p x. (2.2) 

Since 

r(x) = v 2 p E{(g p (z) 2 ) - <a p (^)> 2 ) = v 2 E((g p (z) - (g p (z))) 2 ). 
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we have 

«; f E<few - (ft(«)» 2 >^ = v^(i) - v>'(o) < «j 

Jo 

and by Cauchy inequality 

/ E(\g p (z)-(g p (z))\)dx<l. (2.3) 
Jo 

To prove (12.11) is remains to approximate (g p (z)) by E(g p (z)) and to achieve that we will use 
a simple consequence of convexity of 9 and ip given in the inequality (I2.5P below. Since 

Eg(z) 2 = v 2 J2^ 2p 4<^ 2 

P >i 

if all | x p | < 2, we can apply a well-known Gaussian concentration inequality, for completeness 
given in Lemma [3] in Appendix A, to get 

E\9(x) - il){x) | < Av and E\9(x ± y) - ip(x ± y)\ < 4v. 

Below we will choose \y\ < 1 so that \x ± y\ < 2. (Remark. At the same step in the setting 
of the SK model one also needs to control a random Hamiltonian which produces another 
term of order \/N and this results in unnecessary condition on v in the positivity principle.) 
Inequality (12. 5p then implies that 

E\6'(x) - ij/(x)\ < ip'(x + y)- 4>'(x -y) + . 

y 

Since 

i 

(ip\x + y)- 4>'(x - y))dx = ip(l + y) - ip{l - y) - 4>(y) + ip(-y) < 6v%y 
using (12.21) and \y\ < 1 we get 

/ E\9'(x) - ip'(x)\dx < 6v 2 D y + — < L p v 3/2 

Jo y 

if we take y = v~ x l 2 < 1. Using explicit expressions for 9' and ip' we finally get 

/ E\(g p {z))-E(g p (z))\dx<L p ^. (2.4) 
Jo 

Together with (12.31) this gives 

/ E(\g p (z) - E(g p (z))\)dx < L p ^. 
Jo 

We now recall that E was expectation E g with respect to Gaussian random variables only. 
Also, integral over x = x p G [0, 1] is nothing but expectation with respect to x p . Therefore, 
averaging over all remaining x p finishes the proof. 

□ 

The following inequality was used in the previous proof and it quantifies the fact that 
if two convex functions are close to each other then their derivatives are also close. 
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Lemma 2 If 9{x) and ip(x) are convex differentiable functions then 
\6'(x) - il)'{x) | < ip'(x +y)- tf(x - y) 



(2.5) 



+ -(\iP(x + y)-9{x + y)\ + \^{x -y)-9(x-y)\ + \^(x) - 9(x)\). 



Proof. By convexity, for any y > 

9{x) - 9(x - y) 



y 



< e'(x) < 



9(x + y) - 9(x) 

y 



and 



ip(x) - ip(x -y) < , < ^p{x + y) - ij){x) 



y 



y 



If we define 

U = ~(\il)(x + y) - 9(x + y)\ + Wx - y) - 9(x - y)\ + Wx) - 9(x)\) 
y\ J 

then the above inequalities imply 

< 6(x + y)-6(x) < tp(x + y) - tp(x) + JJ 

y ~ y 

i •( r ) — i •( r - ii ) i •( x + y) -\- ifj(x — y) — 2ip(x) ^ 



= ^ + j_± „ ^_ 

v y 

< ip'(x) + ij)'(x +y)- ip'(x -y) + U. 



Similarly, 



9'{x) > ip'{x) - (ij/(x + y)- i//(x - y)) - U. 
Combining two inequalities finishes the proof. 

Proof of Theorem^ Since |/| < 1 we can write 

K g (fg p (z))-E g (f)K(g p (z))\ < E g (\g p (z) - E g (g p (z)) |). 
By Gaussian integration by parts, the left hand side is equal to nv2~ p x P 4> p where 

P = e s (/(^ • z n+l y) - Ie s (/)e s ((^ • z 2 y) - \ Mf( zl ■ zl f) 



n 



2<Kn 



Lemma [T] then implies nv2~ p E, x x p <p p < L p y/v and, thus, 

E, x x p (f) p < Lv^ 1 / 2 

for some constant L that depends only on n and p. Since (fi p < 2, for any x G (0, 1), 

1 Lv- 1 ' 2 
E x <p p < 2x H E x x p (j) p < 2x H 

X X 



and minimizing over xq we get E x <pp(x p ) < Lv ~ 1 / 4 . Since any continuous function if> on [—1, 1] 
can be approximated by polynomials this, obviously, implies the result. 

□ 

Proof of Theorem^ Step 1. First we use Ghirlanda-Guerra identities to give a lower 
bound on probability that all z 1 ■ z l < —e for 2 < I < n. In order to use Theorem [2] it will be 
convenient to denote by 5 any quantity that depends on (x p ) and such that K x \5\ does not 
depend on v and iV and E x |5| — > as v — > oo. Then (11.31) can be written as 

EgWtf ■ z n+i )) - ^{m^z 1 ■ z 2 )) - l - y: ^im* 1 ■ *)) = s. 

2<l<n 

Even though a function if) is assumed to be continuous, let us use this result formally for 
ip(x) = I(x < —s). The argument can be easily modified by using continuous approximations 
of the indicator function if>. Let 

fn = IJ ^(z 1 ■ z l ) = I(z l ■ z l < -e for 2 < I < n) 

2<l<n 

and let 

a = Eg^z 1 ■ z 1 )) = Egisf^z 1 ■ z 2 < -e}. 
Then by Ghirlanda-Guerra identities 

E fl (/ n+1 ) = Egif^z 1 ■ z n+l )) = h&MEgiifiiz 1 ■ z 2 )) + ■ z 1 )) + 5 

2<l<n 

and since f n ^{z x • z ) — f n we get 

E g (f n+1 ) = n ~ 1 + a E g (f n ) + 6. 
n 

By induction, 

E 9 (f n ) = a J] [ ^ ± ^ + ^I^ + 5 ( 2 - 6 ) 

KKn-l 

where the last inequality follows from a simple estimate for I > 2 

I — 1 + a 1 — a / 1 — a 1 

Step 2. On the other hand, we will show that (f n ) is of order l/(ne) and to emphasize the 
fact that this is true for any measure we now simply write G instead of v g . Hz 1 ,..., z n are 
i.i.d. from distribution G then 

(fn) = (G{z 2 :z x -z 2 < -s}- 1 ) < G{U) + 7- 1 (2.7) 
where, given < 7 < 1, we defined a set 



U 



j^ 1 G S : G{z 2 : z 1 ■ z 2 < -e} > 7}. 



We would like to show that if 7 is close to 1 then G(U) is small. This follows from the fact 
that the average of z 1 ■ z 2 is nonnegative with respect to any product measure, 

(z 1 • z 2 ) = J>^> = J» 2 > 0. 

i<N i<N 

Since z 1 ■ z 2 < 1, 

< (z 1 ■ z 2 ) < -eG^iz 1 ■ z 2 < -e} + G^iz 1 ■ z 2 > -e} = -e + (1 + e)G m {z 1 ■ z 2 > -e} 
and 

e < {l + e)G m {z 1 -z 2 > -e}. (2.8) 

Let us define a conditional distribution GjjiC) = G(UC)/G(U). If z 1 G U then by definition 
G{,z 2 : z 1 ■ z 2 > — e} < 1 — 7 and 

Gu{ - > _ £} < Ig. 

Since GV is concentrated on Z7, 

G gV^>- £ }<Ig. 
Using (12. 8p for Gjj instead of G we get 

e<(l + e)G^.z 2 >-e}< 2 -^l 
and, therefore, G{U) < 2(1 - i)/e. Then, implies that 

( fn ) < 2(1 ~ 7) + 7- 1 < 2(1 ; 7) + e -c»-w-r) 

and we can minimize this bound over < 7 < 1 to get 

(fn)<~ bg ^- 

S'tep 3. Together with (\2M this implies 



a L , ne 
— < —log— + 5. 

1 — a /n c / 



Ln 1 ^ ne L 
By definition of 5 this implies that 

E x cm a < ^logne + E X \5\ 
where K x \8\ — > as v — > 00. For any ao > we can write 

E z a < a + E^a/fa > a ) < a H ~E x an a < a H f— logne + EJ5I Y 
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The right hand side goes to zero if we let v — > oo, n —> oo, a® — > which means that E x a — > 
as f — > oo and this finishes the proof. 

□ 

Proof of Corollary Ql Since (11.21) is uniform in u, by Chebyshev's inequality 

p( J vf{z x ■ z 2 < -s}dQ(v) > 4e) < J. (2.9) 
Next, we will show that for some absolute constant L, 

Pfsup |#(z)| > LvVn) < -. (2.10) 

Indeed, conditionally on (x p ) the process is Gaussian with covariance 

¥ Jg g{z 1 )g{z 2 )=v 2 i{z l -z 2 ) where £( a ) = ^2"^ 

p>i 

Since the first two derivatives of £(s) are bounded uniformly over (x p ) G [0, 1]°°, 

E^z 1 ) - g{z 2 )) 2 = v 2 (az 1 -z 1 )+C(z 2 -z 2 )-2^(z 1 -z 2 )) 
< Lv 2 \z 1 -z 2 \ 2 = Lv 2 E{r ] (z 1 )-r){z 2 )) 2 

where rj is the canonical Gaussian process tj(z) = *Y^9i z i- By a consequence of Slepian's 
inequality (see Corollary 3.14 in [4]) 



E g sup g(z) < LvE, sup rj(z) < Lvy N 

zes zes 



and, by symmetry, E s sup zgS \g(z) \ < Lvy/N which implies ( 12.101) . (12.9p and (12.101) imply 
that the event 

{ / »T{z X ■ z 2 < -e}dQ{u) < 4e| p||sup \g(z)\ < LvVn} 
is not empty and this finishes the proof. 

□ 
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A A gaussian concentration inequality. 

For completeness, we give a proof of the Gaussian concentration inequality which can be 
found, for example, in [5]. 

Lemma 3 Let v be a finite measure and g(z) be a Gaussian process on M. N such that 
^g(z) 2 < a for z in the support of measure v. If 

X = log J exp g{z)dv{z) 

then E(X - EX) 2 < 8a. 

Proof. Let g 1 and g 2 be two independent copies of g. For t e [0, 1] and j = 1, 2 we define 
gi(z) = Vig>(z) + VT^tg(z) and g t (z\ z 2 ) = g^z 1 ) + g 2 (z 2 ). 

Let 

Fj =log expg{{z)du{z) (A.l) 



and let F = F 1 + F 2 . For s > 0, let <p(t) = Eexp s(F 2 - F^. We can write 

^(t) = ^exp S (F 2 -F 1 )^(-l) J+1 exp(-F,) f exp g{(z)du(z). 



i<2 
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If CO* 1 ,* 2 ) :=%(^)^(^ 2 ) then 



and 



^^rfV) = C(*V)"C(*V) = 



Therefore, Gaussian integration by parts gives 

(p'{t) = -s 2 Eexps(F! - F 2 )exp(-F) ^ ^(z 1 , <z 2 ) expa^ 1 , z 2 )dv{z x )dv{z 2 ) 

and since by assumption {((z 1 , z 2 )\ < a on the support of measure v® 2 we get 

(f/(t) < as 2 Eexps(Fi - F 2 ) exp(-F) ^ %x^g t {z x ,z 2 )dv{z x )dv{z 2 ) =as 2 ^{t). 

By construction </?(0) = 1 and the above inequality implies that </?(l) < expas 2 . On the 
other hand, by construction, ip(l) = Eexps(X — X'), where X' is an independent copy of 
X. Thus, 

Eexps(X -X') < expas 2 . 
By Jensen's inequality, Eexps(X — EX) < expas 2 and by Markov's inequality 

P(X — EX >t)< inf exp(as 2 - st) = exp(-^-). 

Obviously, a similar inequality can be written for EX — X and, therefore, 

P(|X -EX| > t) < 2exp(--^-). 

The result follows. 
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